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Calibrating a first and a second microphone 



The invention relates to a method of calibrating a first microphone and a 
second microphone, comprising 

an acquisition step in which a first input audio signal is acquired by means of 
the first nucrophone and a second input audio signal is acquired by means of the second 
5 microphone; 

a calibration step in which a first sensitivity of the first ndcrophone and a 
second sensitivity of the second microphone is determined. 

The invention also relates to an apparatus comprising a first microphone and a 
second microphone for acquiring a first and a second input audio signal respectively, and a 
10 processor for determining a first sensitivity of the first microphone and a second sensitivity of 
the second microphone. 

The invention also relates to a computer program for execution by a processor, 
comprising program code for calibrating a first microphone and a second microphone, 
comprising 

IS- an acquisition step in which a first input audio signal is acquired by means of 

the first microphone and a second input audio signal is acquired by means of the second 
microphone; 

a calibration step in which a first sensitivity of the first naicrophone and a 
second sensitivity of the second microphone is determined. 
20 The invention also relates to a data carrier storing a computer program for 

execution by a processor, comprising program code for calibrating a first microphone and a 
second microphone, which method comprises 

an acquisition step in which a first input audio signal is acquired by means of 
the first microphone and a second input audio signal is acquired by means of the second 
25 microphone; 

a calibration step in which a first sensitivity of the first ntiicrophone and a 
second sensitivity of the second microphone is determined. 
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An apparatus for calibrating microphones is known from WO-A-0201915. The 
known apparatus has a multitude of microphones and is useful for e.g. teleconferencmg. The 
multitude of microphones enables better capture of the speech of a speaker, which leads to 
hi^er intelligibiUty at the receiver side. Algorithms for exploiting the multitude of 
microphones require an accurate calibration of the noicrophones. This can be done in the 
£Etctory in an anechoic chamber, but this is expensive. The known s^aratus performs 
calibration after purchase, which enables the coimection and the calibration of additional 
microphones on the fly. The disadvantage however is that the sensitivity of the microphones 
is determined as the relation between a predet^mined acoustical input signal applied to the 
microphones and a measured electrical output signal from the microphones. 



It is a first object of the invention to provide a method of the kind described in 
the opening paragraph, for calibrating at least two microphones, which is versatile in its use. 

It is a second object of the invention to provide an apparatus of the kind 
described in the op^iing paragraph, which is versatile in its use. 

It is a third object of flie invention to provide a computer program for 
execution on a processor comprising program code coding the method according to the 
invention. 

It is a fourth object of the invention to provide a data carrier storing a 
computer program according to the invention. 

The first object is realized in that in the calibration step an algorithm is applied 
which enables determination of the sensitivities, in the absence of a loudspeaker for the 
generation of the input audio signals. To determine the sensitivities in the known apparatus, a 
loudspeaker is required, which emits a prespecified sound, which serves as the acoustical 
input for the microphones. In the method of the present invention the calibration is performed 
by using an algorithm which allows the microphones to be calibrated with naturally present 
sound, such as speech from a person- e.g. the perspn executing the method- or a sound picked 
\xp on the street. This makes the method, and the apparatus applying the method, more 
employable to practical usage cases, since it avoids carrying around a loudspeaker. 

In an embodiment of the method, the first and tiie second input audio signal 
are processed by an adaptive beamforming filter, and the sensitivities are determined by 
performing a calculation with weights of the adaptive beamforming filter. Beamforming is a 
widely used algorithm for obtaining an increased sensitivity in the direction of a speaker. 
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and/or a reduced sensitivity in the direction of a source of noise, by making use of the input 
signals of a number of microphones. In the embodiment, use is made of the fact that the 
sensitivities of the microphones can be inferred from the coefi&cients of the filters used by the 
beamformer. 

5 In a more specific embodiment of the method, the algorithm comprises 

calculating 




, in which is the discrete Fourier transform of the weights of the 



beamforming filter after adaptation, and the sum ranges over a predetermined number L of 
frequencies Qj^ . Performing a calculation in the Fourier domain makes the determination of 

10 the s^isitivities more robust. 

The second object is realized in that the processor is able to determine tiie 

sensitivities, in the absence of a loudspeaker for generating the input audio signal. Often the 

microphones are integrated in an apparatus which is able to calibrate itself, such a 

teleconferencing apparatus. 
1 5 The tiiird object is realized in that in the calibration step an algorithm is 

applied which enables determination of the sensitivities, in the absence of a loudspeaker for 

generating the input audio signals. 

The fourth object is realized in that in the calibration step an algorithm is 

applied which enables determination of the sensitivities, in the absence of a loudspeaker for 
20 generating the input audio signals. 

These and other aspects of flie method, the apparatus, the computer program 
and the data carrier according to the invention will be apparent from and elucidated with 
25 reference to the implementations and embodiments described hereinafter, and vidth reference 
to the accompanying drawings, which serve merely as a non limiting illustration. 
In the drawings : 

Fig. 1 schematically shows a teleconferencing session; 

Fig. 2 schematically shows the method of calibrating a first and a second 

30 microphone; 

Fig. 3 schematically shows a beamforming apparatus; 

Fig. 4 schematically shows a microphone calibration apparatus of the prior art; 
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Fig. 5 schematically shows an apparatus for relative calibration of a first and a 
second microphone according to the invention; and 
Fig. 6 shows a data carrier. 

In these Figm^s elements drawn dashed are optional depending on the desired 

embodiment. 



In Fig. 1, a teleconferencing session is shown. A locally present person 107 is 
commmiicating with a remote person 109, who is e.g. shown on a display 111. For the 
commmiication of speech, an audio communication device is required, which is represented 
by the console 101. It can contain e.g. buttons, a small status display, a loudspeaker for the 
reproduction of speech uttered by the remote person 109, and a microphone. Practice shows 
that the locally present person 107 has to be very close to the microphone in the console 101, 
if he wants that the remote person 109 understands what he is saying. It is much more 
practical if the locally present person 107 can reside anywhere he likes. To achieve this, more 
than one microphone is used, illustrated with the first microphone 103 and the second 
microphone 105 in Fig. 1. Techniques have been developed to take advantage of the spatial 
arrangement of multiple microphones, in order to better capture the speech of a speaker. The 
beamforming technique is explained by means of Fig. 3. There are numerous applications 
which benefit fix)m beamfomiing and more particular ficom the method for the relative 
calibration of at least two microphones described in this text. One example is voice control. 
E.g. a television set could be equipped with a remote control based on keywords. 
Beamforming helps in decreasing the keyword recognition failure rate. Also portable devices 
can be equipped with more than one microphone. 

In Fig. 2, a first input audio signal ul, coming firom a first microphone 205, 
and a second input audio signal u2, coming fi'om a second microphone 207, are acquired 
during an acquisition step ACQ. Both iaput audio signals ul, u2 are used in the calibration 
step CAL to determine a first sensitivity al of the first microphone 205 and a second 
sensitivity a2 of the second microphone 207. 

Fig. 3 shows an apparatus 241 which is able to apply filtered-simi 
beamforming to the output of a number of microphohes, e.g. three. A first sound source is 
speech fi^om speaker 201. Suppose that the speech contains a single wavelength, and that a 
speech wavefiront 233 is planar and parallel to an imaginary line nmning througih the first 
microphone 205 and the second microphone 207. Each miax>phone then picks up the same 
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sound signal. The first microphone 205 converts the sound into a sampled first electrical 
audio signal ul, and the same applies to the second microphone 207 and fiirther n^crophones 
if present. If the sensitivities of the microphones 205 and 207 are equal, the sampled 
electrical audio signals ul and u2 are equal. Suppose further that a second sound source 203 
produces e.g. music of a single wavelength, with planar wavefronts which impinge on the 
microphone array imder an ang^e 9. Then a music wavefix)nt 23 1 arrives at the first 
microphone 205 earlier than at the second microphone 207. This implies that the electrical 
audio signals ul and u2 are samples at different phases of a sinusoid of a certain spatial 
wavelength , which is related to the direction of incidence 6 and the wavelength X of the 
music of the second sound source 203. A highpass spatial filter can be designed which 
transmits the speech firom speaker 201 with an infinite spatial wavelength , but blocks the 
undesired interference firom the second sound source 203. A spatial filter consisting of a 
single multiplication coefficient for each ndcrophone is sufficient for fixed position, single 
wavelength sound sources. 

For broadband sound sources emitting more wavelengths, a temporal filter is 
placed behind each microphone instead of a single multiplication coefficient. E.g. a first 
temporal filter 221 filters the electrical signal ul of the first microphone 205. Successive 
samples of ul are delayed by delay elements, like a first delay element 227, and the delayed 
samples are multiplied by filter coefficients, like a second filter coefficient 228, and added 
together by adders, like a first adder 229. The number of filter coefficients is dependent on 
how many samples firom a sound signal are desirable and on how many computing resources 
are available. The outputs of the temporal filters 221 and 223 are summed by the spatial 
summation 230, to obtain a spatiotemporal filter output z. Spatiotemporal filter 240 can be 
described mathematically by means of equation [1]: 

^(«) = ZZ [1] 

In equation [1], describing a filtered sum beamformer, n is a discrete time 
index, 1 is an index of a filter coefficient w, T is a time difference between samples, and m is 
a microphone index corresponding to one of the microphones (205 and 207) and temporal 
filters (221 and 223). 

In order to properly filter out the audio signal of the interfering second soxmd 
source 203, the filter coefficients have to get the appropriate values during an adaptation 



wo 2004/025989 PCT/IB2003/003499 

6 

phase of beamforming. If adaptation applies e.g. an algorithm which maximizes the power of 
z(n), imder the constraint that for all frequencies Cl^ the following condition is satisfied : 

^cici.y [2], 

in which W„(Clf^) is the discrete Fourier transform of the filter (w) and C a 
constant, then the optimal filter coefficients after adaptation satisfy equation [3]: 

W:i£i,)=a(Ci,)C(Ci,)-=S^Bd==r [3]. 



V 



In equation [3], i?^ (Qj^ ) is Ihe conq)lex conjugate of the discrete Fourier 

transform of the acoustic impulse response for fhe microphone with index m. E.g. the first 
acoustical impulse response hi in Fig. 2 is the acoustic impulse response modeling the sound 
10 transfer from the speaker 201 to the first microphone 205. a(Qjt ) is an all-pass term which is 
common to aU temporal filters 221, 223 and 225. 

For a pla3tiar wavefront like the music wavefit>nt 23 1, the acoustic transfer 
fimctions are propagation delays, and hence for each frequency k and microphone index m, 
equation [4] applies: 

15 \H„i£l,f=l [4]. 

hi a reverberant room this model is too simple. The sound traveling directly 
from e.g. speaker 201 to e.g. the first microphone 205, can interfere constructively or 
destructively with e.g. a first reflection of the sound from speaker 201 on a nearby wall. This 
can imply that e.g. at the position of tiie first microphone 205 there is hardly any sound 
20 power present at frequency . It is very unlikely that the interference should occur for all 

possible frequencies Q^^ at the spatial position of a ntucrophone, e.g. the first microphone 
205. Hence equation [5] is highly likely to be vaUd: 



1 2N^\ * 

Using equation [5], which guarantees that the sound from speaker.201 is 
25 transferred q>proximately equally to the microphones 205 and 207, it can be proven that tiie 
relative sensitivities a„ of eachmiax>phone follow from equation [6]: 



'W§^ [61. 
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Hence it is possible to introduce correction factors - 21 1 and 213 in Fig. 2 - 
which mxiltiply witii the electrical audio signals ul and u2, in order to make the microphones 
205 and 207 equally sensitive. These correction factors can be calculated as in equation [7]: 

fc„=-f [7], 

m 

5 in which c is a constant. 

The spatiotemporal filter 240 unplementing filtered sum beamfonner [3] 
identifies the acoustic transfer fimctions between a sound source measured during calibration 
and the microphones upto an unknown error factor which is common to all microphones. The 
feet that the error fector is common to all microphones allows calibration of the microphones 

10 relative to each other. 

If the speaker is a certain distance away from the microphones, e.g. sittmg in a 
chair watching a television with multiple microphones for voice command, the method works 
well because the acoustic impulse responses hi and h2 are similar to propagation delays, 
which means that all microphones receive essentially the same sound as input. If there is a 

15 strong reverberation firom e.g. a nearby wall to a microphone at certain firequencies, the 

method may work less well. Pathological frequency regions can be discarded by modifying 
the algorithm, by using equation [8] in place of equation [6]: 



The sum in equation [8] is taken over a nimiber i of frequency intervals 
20 [fc^,*,+i],inwhiche.g.nospuriouslylargevaluesof occur. If the sum covers 

enough frequencies , is still a reliable measure of the relative sensitivity of the m-th 
microphone. is the total number of frequencies in all the intervals [k^.k^^^ ] together. To 
increase accuracy, it is advantageous to also drop the lowest and highest frequencies from the 
svimmation, since some of the microphones can have a spurious behavior in these frequency 
25 regions. 

Fig. 4 shows a microphone calibration apparatus of the prior art. An electrical 
loudspeaker audio signal e is sent fix)m a signal source 304 to loudspeaker 301, in which it is 
converted to sound 302, which is picked up by microphone 303. Microphone 303 converts 
the sound to an electrical microphone audio signal s. In the known apparatus it is required 
30 that both the loudspeaker audio signal e and the microphone audio signal s, are sent to a 
processor 305, which is able to determine microphone sensitivity 307 fix)m the two audio 
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signals. In the current invention no loudspeaker audio signal e is required by the calibration 

algorithm. The input of a sound source like speaker 201 is sufficient. 

Fig. 5 shows an s^paratus 401 for relative calibration of a first and a second 

microphone 403 and 405 according to the invention. A processor 407 has access to a first 
5 audio signal fix)m a first microphone 403 and a second audio signal fi:om a second 

microphone 405. It is possible to run an algorithm according to tiie invention, as e.g. 

illustrated with Fig. 3, on the processor 407, which e.g. calibrates the microphones 403 and 

405 after a certain amount of time to counteract time varying effects like e.g. component 

aging or temperature related efTects. Another option is that a user pushes e.g. a button 409 
10 and initiates the calibration, e.g. every time he takes the apparatus into a different room 

which has different acoustic impulse responses. 

An interesting option is to calibrate only when the sound conaing into the 

microphones is speech, by adding a speech detector. 

Fig. 6 shows a data carrier for storage of a computer program for execution on 
15 a processor describing a method according to the invention for calibrating a fixst and a second 

microphone. 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention and that those skilled in the art are able to design alternatives, without 
departing firom the scope of the claims. Apart from combinations of elements of the invention 

20 as combined in the claims, other combinations of the elements within the scope of the 
invention as perceived by one skilled in the art are covered by the invention. Any 
combination of elements can be realized in a single dedicated element. Any reference sign 
between parentheses in the claim is not intended for limiting the claim. The word 
"comprising" does not exclude the presence of elements or aspects not listed in a claim. The 

25 word "a" or "an" preceding an element does not exclude the presence of a plurahty of such 
elements. 

The invention can be implemented by means of hardware or by means of 
software ruiming on a computer. 



